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1 Data base directions: the next ste ps | 
John L. Berg 

November 1976 ACM SIGMOD Record , ACM SIGMIS Database, Volume 8 , 8 issue 4 , 2 
Publisher: ACM Press 

Full text available: "gj pdf (9.95 MB) Additional Information: full citation, abstract 

What information about data base technology does a manager need to make prudent 
decisions about using this new technology? To provide this information the National 
Bureau of Standards and the Association for Computing Machinery established a workshop 
of approximately 80 experts in five major subject areas. The five subject areas were 
auditing, evolving technology, government regulations, standards, and user experience. 
Each area prepared a report contained In these proceedings. The proceedings p ... 

Keywords: DBMS, auditing, cost/benefit analysis, data base, data base management, 
government regulation, management objectives, privacy, security, standards, technology 
assessment, user experience 



2 Using GATE as an environment for teaching NLP Q 
Kalina Bontcheva, Hamish Cunningham, Valentin Tablan, Diana Maynard, Oana Hamza 

July 2002 Proceedings of the ACL-02 Workshop on Effective tools and methodologies 
for teaching natural language processing and computational linguistics - 
Volume 1 

Publisher: Association for Computational Linguistics 

Full text available: pdf(439.53 KB) Additional Information: full citation , abstract , references 

In this paper we argue that the GATE architecture and visual development environment 
can be used as an effective tool for teaching language engineering and computational 
linguistics. Since GATE comes with a customisable and extendable set of components, it 
allows students to get hands-on experience with building NLP applications. GATE also has 
tools for corpus annotation and performance evaluation, so students can go through the 
entire application development process within its graphical develop ... 

3 Ja panese lan g uage handling in APL environments B 
Kyosuke Saigusa 

July 1998 ACM SIC APL API Quote Quad , Proceedings of the APL98 conference on 
Array processing language APL '98, volume 29 issue 3 
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Publisher: ACM Press 

Full text available: pdf(555,47 KB) Additional Information: full citation , abstract , references , index terms 

While Japan is amongst the first countries to use APL back in the early 1970's and 
enjoyed at one time as high as 40% of IBM main-frame customers using APL, the lack of 
Japanese language support of APL in popular computing environments today such as 
UNIX/AIX, OS/2 and Windows seems to be driving APL out of the scene.This paper tries to 
bring up this subject more officially than I have done in the past for more public 
discussion not only for Japan, but also for those countries, where the characte ... 

Information retrieval on the web 
Mei Kobayashi, Koichi Takeda 

June 2000 ACM Computing Surveys (CSUR), volume 32 issue 2 
Publisher: ACM Press 

Full text available - ^ Ddf(213.89 KB) Additional Information: full citation , abstract, references , dtings. index 
^ terms 

In this paper we review studies of the growth of the Internet and technologies that are 
useful for information search and retrieval on the Web. We present data on the Internet 
from several different sources, e.g., current as well as projected number of users, hosts, 
and Web sites. Although numerical figures vary, overall trends cited by the sources are 
consistent and point to exponential growth in the past and in the coming decade. Hence it 
is not surprising that about 85% of Internet user ... 

Keywords: Internet, World Wide Web, clustering, indexing, information retrieval, 
knowledge management, search engine 



5 Pen computing: a technology overview and a vision 
Andre Meyer 

^ July 1995 ACM SIGCHI Bulletin, volume 27 issue 3 
Publisher: ACM Press 

Full text available: '^ pdf(5.14 MB) Additional Information: full citation , abstract , citin gs, index terms 

This work gives an overview of a new technology that is attracting growing interest in 
public as well as in the computer industry itself. The visible difference fronn other 
technologies is in the use of a pen or pencil as the primary means of interaction between ' 
a user and a machine, picking up the familiar pen and paper interface metaphor. From 
this follows a set of consequences that will be analyzed and put into context with other 
emerging technologies and visions. Starting with a short historic ... 

6 Im plementing SMART for minicomputers via relational processin g With abstract data 

<|> ty^ees 

^ Edward A. Fox 

October 1981 Proceedings of the 1981 ACM SIGSMALL symposium on Small systems 
and SIGi^OD workshop on Small database systems 

Publisher: ACM Press 

Full text available: ® pdf(948.46 KB) Additional Information: full citation , abstrict. references , citings, index 

terms 

Designed during the 1960's as a research tool for the field of information retrieval, the 
SMART system has been operating on an IBM 370 since 1974. SMART is now being 
enhanced, redesigned, and programmed under the UNIX operating system [28] on a DEC 
VAX 11/780. The techniques used should allow real-time operation on smaller 
minicomputers in the PDF 11 family. The implementation provides for a combination of 
database and Information retrieval operations which make it applicable to office aut ... 
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A storage and access manager for ill-structured data 
Jeffrey Kottemann, Michael Gordon, Jack Stott 
August 1991 Communications of the ACM, volume 34 issue 8 

Publisher: ACM Press 

Full text available: "SI pdf(2 04 MB) Additional Information: full citation , abstract, references . iodexternQ.s. 

Database management systems are powerful tools for processing large volumes of 
structured, or normalized, data. Much of the data to be stored in computer systems, 
however, differs from normalized data in both its logical uses and the storage structure • 
required for its effective management. For instance, Van Rijsbergen (1979) distinguishes 
database retrieval from information retrieval (IR)— the retrieval of references to text— by 
c ... 

8 A Unicode based adaptive se g nnentor 

Q. Lu, S. T. Chan, R. F. Xu, T. S. Chiu, B. L LI, S. W. Yu 
' July 2003 Proceedings of the second SIGHAN workshop on Chinese language 
processing - Volume 17 
Publisher: Association for Computational Linguistics 

Full text available: '^ pdf(114.08 KB) Additional Information: full citation , abstract , references 

This paper presents a Unicode based Chinese word segmentor. It can handle Chinese text 
in Simplified, Traditional, or mixed mode. The system uses the strategy of divlde-and- 
conquer to handle the recognition of personal names, numbers, time and numerical , 
values, etc in the preprocessing stage. The segmentor further uses tagging information to 
work on disambiguation. Adopting a modular design approach, different functional parts 
are separately implemented using different modules and each module ta ... 

9 Al phabet Sou p 
Stephen Turnbull 

March 1999 Linux Journal 

Publisher: Specialized Systems Consultants, Inc. 

Full text available: jg| html ( 40.76 KB ) Additional Information: full citation , abstract , references , index terms 

The Internationalization of Linux, Part 1: Mr. Turnbull takes a look at the problems faced 
when different character sets and the need for standardization 

Determinin g a computin g center environment 
^ Robert F. Rosin 

July 1965 Communications of the ACM, volume 8 issue 7 

Publisher: ACM Press 

Full text available:'gl Ddf(676.32 KB) Additional Information: full citation , abstract, references, citings, index 

An investigation is described in which several generally unavailable parameters 
descriptive of a computing center environment are obtained. The actual data collection 
and reduction is described, and the results of one month of this collection are tabulated 
and summarized. 

Information stora g e and retrieval: a survey and functional description 
Jack MInker 

September 1977 ACM SIGIR Forum, volume 12 issue 2 
Publisher: ACM Press 

Full text available: ^ pdf(5.14 MB ) Additional Information: full citation , abstract , references 

Information Storage and Retrieval (IS&R) encompasses a broad scope of topics ranging 
from basic techniques for accessing data to sophisticated approaches for the analysis of 
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natural language text and the deduction of information. Within the field, three general 
areas of investigation can be distinguished not only by their subject matter but also by 
the types of individuals presently interested in them:(l) Document retrieval, (2) 
Generalized data management, and(3) Question-answering.A functional ... 

Keywords: automatic indexing, data management, data structures, deductive search, 
Information retrieval, natural language, problem solving, question-answering, relational 
data systems, theorem proving 



12 Current issues in software en g ineering for Natural Langua g e Processing 
Jochen L. Leidner 

May 2003 Proceedings of the HLT-NAACL 2003 workshop on Software engineering 
and architecture of language technology systems - Volume 8 SEALTS '03 

Publisher: Association for Computational Linguistics 

Full text available: pdf( 226.48 KB) Additional Information: full citation , abstract , references 

In Natural Language Processing (NLP), research results from software engineering and 
software technology have often been neglected.This paper describes some factors that 
add complexity to the task of engineering reusable NLP systems (beyond conventional 
software systems). Current work in the area of design patterns and composition 
languages is described and claimed relevant for natural language processing. The benefits 
of NLP componentware and barriers to reuse are outlined, and the dichotomies ... 

13 Live documents with contextual, data-driven information components 
Anke Weber, Holger M. Kienle, Hausi A. Muller 

October 2002 Proceedings of the 20th annual international conference on Computer 
documentation 

Publisher: ACM Press 

,- .1* ^ ui HA i^Dx Additional Information: full citation , abstract , references , citin gs, index 
Full text available: pdf(627.10 KB) ^ 

We Introduce the notion of a live document and we describe our concept of live 
documents with contextual, data driven information components. The dynamic and 
interactive features of live documents provide a consistent data source for multimedia 
presentations targeted to various audiences and multiple platforms. Therefore, they 
contribute to the solution of key challenges in single sourcing and repurposing. We 
motivate the use of live documents with sample scenarios from the field of systems 
docu ... 

Keywords: Microsoft Office, live documents, repurposing, reverse engineering, scalable 
vector graphics, single sourcing, software engineering, systems documentation 



Ap plications of data structures: Data structures for computer graphics Q 

Marshall D. Abrams 

February 1971 ACM SIGPLAN Notices, volume 6 issue 2 
Publisher: ACM Press 

Full text available: '^ pdf(2.49 MB) Additional Information: full citation , abstract , references 

This paper introduces data structures as applied to computer graphics. Design criteria for 
computer graphics .data structures are discussed, followed by a comparison of general- 
purpose and tailored graphic data structures. A general graphic data structure is 
introduced as an example of a structure meeting the preceding criteria. The language is 
then examined as a tool for implementing the above data structure, and is compared to a 
few other language systems. 
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15 Research in music and artificial intelligence 
Curtis Roads 

N/ June 1985 ACM Computing Surveys (CSUR), volume 17 issue 2 
Publisher: ACM Press 

r- ^ •. 1.. .sn ^*/o7oikiiDv Additional Information: full citation , abstract , references, citings , index 

Full text available: 'm pdf (2.72 MB) ^ T 

^.t^y terms , review 

Although the boundaries of artificial intelligence (AI) remain elusive, computers can now 
perform musical tasks that were formerly associated exclusively with naturally intelligent 
musicians. After a historical note, this paper sermonizes on the need for AI techniques in 
four areas of musical research: composition, performance, music theory, and digital 
sound processing. The next part surveys recent work Involving AI and music. The 
discussion concentrates on applications in the four areas o ... 

1^ Security Mechanisms in High-level Network Protocols 
^ victor L. Voydock, Stephen T. Kent 

>^ June 1983 ACi^ Computing Surveys (CSUR), volume 15 issue 2 
Publisher: ACM Press 

Full text available: W{ pdf(3.23 MB) Additional Information: full citation , references , citings 



''^ The Inte g rated Dictionary/Director y S ystem Q 

^ Frank W. Allen, Mary E. S. Loomis, Michael V. Mannino 

^ June 1982 ACM Computing Surveys (CSUR), Volume 14 issue 2 

Publisher: ACM Press 

Full text available: 'g|pdf (271 MB) Additional Information: full citation , references , citings , index terms 



18 Extendin g Java for high-level Web service construction 
^ Aske Simon Christensen, Anders M0ller, Michael I. Schwartzbach 

November 2003 ACM Transactions on Programming Languages and Systems 

(TOPLAS), Volume 25 Issue 6 

Publisher: ACM Press 

Additional Information: full citation , abstract , references , citings. Index 



Full text available: ^ 

^ terms 

We incorporate Innovations from the <bigwig> project into the Java language to provide 
high-level features for Web service programming. The resulting language, JWIG, contains 
an advanced session model and a flexible m.echanism for dynamic construction of XML 
documents, in particular XHTML. To support program development we provide a suite of 
program analyses that at compile time verify for a given program that no runtime errors 
can occur while building documents or receiving form input, and ... 



Keywords: Interactive Web services, XML, data-flow analysis 



19 A comparison of messa g e passing and shared memory architectures for data parallel Q 

^ pro g rams 

^ A. C. Klaiber, H. M. Levy 

April 1994 ACM SIGARCH Computer Arciiitecture News , Proceedings of the 21ST 

annual international symposium on Computer architecture ISCA '94, volume 

22 Issue 2 

Publisher: IEEE Computer Society Press, ACM Press 

Full text available: pdf(1 .26 MB) Additional Information: full citation , abstract , references , citings, index 
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Shared memory and message passing are two opposing communication models for 
parallel multicomputer architectures. Comparing such architectures has been difficult, 
because applications must be hand-crafted for each architecture, often resulting in 
radically different sources for comparison. While it is clear that shared memory machines 
are currently easier to program, in the future, programs will be written in high-level 
languages and compiled to the specific parallel target, thus eliminating thi ... 

20 A case study in the use of data base nnanagement 
A. M. Koss, J. M. Noonan 

October 1976 Proceedings of the annual conference 

Publisher: ACM Press 

Full text available: pdf(397.30 KB) Additional Information: full citation , abstract , references , index ternns 

Data base management systems have been presented as the means for improving 
information accessibility and consolidating data processing. In a non-dedicated computer 
environment the resource allocation that such systems require must be considered. Based 
on more than four years experience with data base management systems, this paper 
analyzes the efficiencies and computer resource usage of a data base management 
system with random access in comparison with the efficiencies and computer resour ... 
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[PDF] Abstract 

File Format: PDF/Adobe Acrobat - View as HTML 

Hamish defines information, extraction as a process that takes texts (and sometimes 
speech) as input and produces fixed-format,, unambiguous data as output ... 
www.ifla.org/IV/ifla72/papers/097-ZhixiQng_Sa_Zhengxin Ying-en.pdf - Similar p a g es 

Created by RL Brown 

Problems with free and fixed format ASCII files disappear with the ASCII Data Dictionary 
Builder. DBMS/Copy features direct Microsoft Access support for ... 
www.son.wisc.edu/rdsu/data%20course/data%20course.html - 23k - 
Cached - Similar p a ges 



Python Unicode Integration Proposal Version: 1.8 ,., 

These allow processing of the data in chunks to efficiently use memory. „. The constructor 
does not check the data for Unicode compliance or use of ... 
www.egenix.com/files/python/unicode-proposal.txt - 39k - Cached - Similar pages 

PEP 100 - Python Unicode Integration 

These allow processing of the data in chunks to efficiently use memory. ... internal format 
for Unicode objects should use a Python specific fixed format ... 
www.python.org/dev/peps/pep-0100/ - 52k - Cached - Similar pag es 

Glossary of terms and abbreviations 

A number of supplied nodes provide basic processing such as input and output. ... that is 
composed of a number of fixed format data structures or elements, ... 

publib.boulder.ibm.com/infocenter/wmbhelp/ 

v6r0m0/topic/com.ibm.etools.mft.doc/ax99997_.htm - 80k - Cached - Similar p ages 
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File Format: PDF/Adobe Acrobat - View as HTML 

using basic tagging and a standard ASCII or Unicode encoding will retain its value. ... In 
particular, fixed format data must never contain XML tagging, ... 
www.turnkey.com.au/tksweb/papers/XML2001T.pdf - Similar pages 

[PDF] Enterprise Data Publishing with Oracle Reports: Any Data. Any .„ 

File Format: PDF/Adobe Acrobat - View as HTML 

In a modern IT environment, data is produced in different, places and different formats. ... 
with Unicode documents, displaying different character-sets ... 

www.oracle.com/technology/products/reports/10g/reports_technicalWP.pdf - Similar pages 
UTD: Universal Text Data format 

But the general idea of submitting some data to some processing could be ... since 
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NIST Fingerprint Data Interchange Workshop Summary 
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